nsposon sequencing count data set has two types of transposon
They are the insertion statistic and the site statistic. The former
r the number of transposon insertions per gene and the latter
r the number of transposon insertion sites per gene. These two
statistics represent two types of transposon events in a genome.
tion statistic represents the total number of transposon insertions
as. This value depends on the sequencing coverage. The greater
encing coverage, the greater the insertions per gene. The site
epresents the strength of the transposon distribution across a gene.
stic depends on two factors, i.e. the sequencing coverage and the
ntiality.
2.12 shows a few lines of one replicate of the transposon
ng data of the Francisella tularensis species [Yang, et al., 2017].
t these statistics were all gene-wise summary statistics. This
at both the insertions per gene and the sites per gene have been
sed from genome-wise data.
The first few lines of the first replicate of the Francisella Tularensis species.
nds for the insertion statistic. ‘sites’ stands for the site statistic. ‘MF’ stands for
n feature statistic, which is a convolution between ‘counts’ and ‘sites’.
ID#ymbol
counts
sites
MF
FTT_0001#dnaA
402
7
1.446
FTT_0002#dnaN
280
18
2.913
FTT_0004c#isftu1
7490
159
1428.707
FTT_0005a#-
766
37
140.050
FTT_0006#-
15466
212
2695.370
The transposon insertion spectrum of the gene dnaA (FTT_0001) in the first
the Francisella Tularensis species.